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Abstract. This is an introductory level survey of some topics from a new branch of 
fractal analysis — the theory of self-similar groups. We discuss recent works on random 
walks on self-similar groups and their applications to the problem of amenability for 
these groups. 

'd^" 
O 
O 

^ : Introduction 

O-i- The purpose of this paper is to give a brief survey of some ideas and methods associated 

. with new progress in understanding the so-called self-similar groups. This class of groups 
consists of automorphisms of homogeneous rooted trees defined in a simple recursive way. 
A good account of the initial period of the theory can be found in the survey |BGN03j by 
' Bartholdi, Grigorchuk and Nekrashevych and in the recent monograph of Nekrashevych 
^ . |Nek05] (their authors are among the most active contributors to this field). 

I Self-similar groups have a natural interpretation in terms of fractal geometry; their limit 

sets are very interesting fractal sets (see the recent papers |NT08[ IRT09] for a study of 
Laplacians on limit sets). These groups also arise as iterated monodromy groups of rational 
endomorphisms of the Riemann sphere, which, for instance, led to a recent solution of an 
old problem from rational dynamics |BN06j . 
>• ' Self-similar groups are often quite unusual from the point of view of the traditional 

group theory. This has both advantages and disadvantages. On one hand, by using self- 
'. similar groups it is easy to construct examples which may otherwise be much less accessible 
O I (for instance, the famous Grigorchuk group of intermediate growth [GriSOl IGri85l IdlHOO] 
'nI" ■ has a very simple self-similar presentation). On the other hand, even the simplest group 
theoretical questions for self-similar groups may be quite hard. Already finding a self- 
O ! similar realization of a free group is very far from being obvious |BS98l IGM051 IV VP 7] . 

Another question of this kind is whether a given self-similar group is amenable (amena- 
bility introduced by von Neumann |vN29] is, in a sense, the most natural generalization 
of finiteness, and it plays a fundamental role in group theory). There are numerous char- 
5^ ! acterizations of amenability in various terms. In particular, it is known to be equivalent 
to existence of a random walk on the group with trivial behaviour at infinity (= trivial 
Poisson boundary) [Fur 731 [RosSll [KVSSj . It turns out that self-similar groups may have 
random walks which are also self-similar in a certain sense, and it is this self-similarity that 
can be used in order to prove triviality of the Poisson boundary, and therefore establish 
amenability of the underlying group. 

This idea was first used by Bartholdi and Virag [BV05] for proving amenability of the so- 
called Basilica group B. This group first studied by Grigorchuk and Zuk |GZ02aj has a very 
simple matrix presentation and also arises as the iterated monodromy group of the map 
z z"^ — 1 (known as the Basilica map, whence the name). In particular, Grigorchuk and 
Zuk proved that B is not subexponentially elementary, which made especially interesting 
the question about its amenability. The approach of Bartholdi and Virag was further 
developed by Kaimanovich |Kai05] who used the entropy theory of random walks (which 
provides a simple criterion of triviality of the Poisson boundary) in combination with a 
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contraction property for the asymptotic entropy of random walks on self-similar groups 
(the "Miinchhausen trick"). 

An important ingredient of this technique is a link (established in |Kai05j ) between 
self- similarity and the so-called random walks with internal degrees of freedom (RWIDF) 
[KS83j also known under the names of matrix-valued random walks [CW89j or of covering 
Markov chains jKai95] . These are group invariant Markov chains which take place on the 
product of a group by a certain parameter set. Any random walk on a self-similar group 
naturally gives rise to a random walk with internal degrees of freedom parameterized by 
the alphabet of the action tree of the group. In turn, this RWIDF, when restricted to the 
copy of the group corresponding to a fixed value of the freedom parameter, produces a new 
random walk on the self-similar group (as it is pointed out in |GN07j . this transformation 
corresponds to the classical operation of taking the Schur complement of a matrix). It is 
the interplay between the original and the new random walks, which allows one to apply 
the Miinchhausen trick. 

This technique was recently applied by Bartholdi, Kaimanovich and Nekrashevych 
|BKN08] to prove amenability of all self-similar groups generated by bounded automata 
(this class, in particular, contains the Basilica group). 

In this survey we attempt to give a historical and conceptual overview of these devel- 
opments without going into technical details, so that it should hopefully be suitable for 
a first acquaintance with the subject. Structurally, our presentation is split into three 
parts. In Section [T] we discuss the notion of self-similarity, introduce self-similar groups in 
general and the subclass of self-similar groups generated by bounded automata. Further 
in Section [2] we briefly discuss the notion of amenability of a group. Finally, in Section [3] 
we analyze random walks on self-similar groups, and show how they can be used for 
establishing amenability of self-similar groups. 

The presentation is based on a talk given at the "Fractal Geometry and Stochastics IV" 
conference (and on several other occasions as well). I would like to thank the organizers 
of this meeting for a very interesting and inspiring week in Greifswald. 

1. Self-similar groups 

l.A. And so ad infinitum . . . The modern idea of self-similarity is best described by 
the following quote from On Poetry: a Rhapsody by Jonathan Swift (1733)0: 

So, naturalists observe, a flea 
Has smaller fleas that on him prey; 
And these have smaller still to bite 'em. 
And so proceed ad infinitum. 

On a more formal level, the simplest self-similarity assumptions are: 

• a part is similar to the whole; 

• the whole is a union of such parts; 

• these parts are pairwise disjoint. 



It was naturally extended by Augustus de Morgan in his Budget of Paradoxes (1872): 

Great fleas have httle fleas upon their backs to bite 'em, 

And Httle fleas have lesser fleas, and so ad infinitum. 

And the great fleas themselves, in turn have, greater fleas to go on; 

While these again have greater still, and greater still, and so on. 

This is a description of what is nowadays called the natural extension of a non-invertible dynamical 
system. See jKai03| for its applications in the context of fractal sets. 
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These assumptions naturally lead to the most fundamental self-similar structure, that 
of a rooted homogeneous tree. Such trees are, for instance, skeletons of iterated function 
systems satisfying the open set condition (e.g., see |Fal03] ) . the simplest one of which is the 
classical Cantor set. More precisely, let X = {1, 2, . . . , c?} be a finite set called the alphabet. 
Denote by X* the set of all finite words in the alphabet X (including the empty word 0). 
In other terms, X* is the free monoid generated by the set X (the composition being the 
concatenation {w,w') i— > ww'). The associated rooted homogeneous tree T = T{X) is the 
(right) Cayley graph of the free monoid X* (so that one connects w to wx by an edge for 
aWw eX*,xe X). The tree T{X) = X* is split into levels T„ = X"" (the set of words of 
length n). The level Tq consists only of the empty word 0, which is the root of T. Each 
vertex w & T = X* is the root of the subtree which consists of all the words beginning 
with w. The map w' i— >• ww' provides then a canonical identification of the trees T and 
Tu,, see Figure [H where X = {a, b}. 




Figure 1. 



l.B. Generalized permutation matrices. The group = Aut(T) of automorphisms 
of the tree T obviously preserves each level of T. In particular, it acts by permutations 
on the first level X = Ti, i.e., there is a homomorphism g ^ a = from C5 to Sym(X) 
(the permutation group on X) . Another piece of data associated with any automorphism 
g E & \s a. collection of automorphisms {gx}x&x indexed by the alphabet X. Indeed, 
if y = cT^(a;), then g establishes a one-to-one correspondence between the corresponding 
subtrees and Ty rooted at the points x and respectively. Since both subtrees and 
Ty are canonically isomorphic to the full tree T, the map g : ^ Ty is conjugate to 
an automorphism of T denoted by g^ (in terms of Swift's description above, g^ describes 
what happens on the back of a first order flea when it moves from position x to position 
y = a^{g)), see Figure [21 

Conversely, a permutation a G Sym(X) and a collection {gx}xGX of elements of (25 
uniquely determine the associated automorphism g E (3. In algebraic terms it means 
that g I— > (o"^; {gx}xex) is an isomorphism of the group (25 and the semi-direct product 
Sym(X) X (in yet another terminology, & is isomorphic to the permutational wreath 
product & I Sym(X)). There is a very convenient way of visualizing this structure by 
means of generalized permutation matrices. 

Recall that the usual permutation matrix M" associated with a permutation a G 
Sym(X) is a \X\ x \X\ matrix with entries 

^ f 1 , if y = c^(a:), 
\ 5 otherwise ' 
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Figure 2. 

and that the map a ^ M°" is a group isomorphism. In the same way we shall present the 
data ((T^; {gx\x&x) by the generalized permutation matrix with entries 

g.^ , iiy = (r^{x), 



=- 

\ , otherwise. 

For instance, the automorphism g described in Figure [3] is presented by the matrix 

gi 



g2 




Figure 3. 

More generally, given an arbitrary group G, we shall denote by 

Sym(X; G) = Gl Sym(X) = Sym(X) X 

the group of generalized permutation matrices of order |X| with non-zero entries from the 
group G. The group operation here is the usual matrix multiplication, the only difference 
with ordinary permutation matrices being that the matrix elements are multiplied accord- 
ing to the group law of G. Obviously, application of the augmentation map (which consists 
in replacing all group elements with 1) to a generalized permutation matrix yields a usual 
permutation matrix, which corresponds to the natural homomorphism of Sym(X; G) onto 
Sym(X). We can now sum up the above discussion by saying that there is a natural 
isomorphism of the group <& = Aut(T) of automorphisms of the tree T = T{X) and of 
the generalized permutation group Sym(X; C5). It is this isomorphism that embodies the 
self-similarity properties of the group (5. 

I.e. Self-similar groups and matrix presentations. 

Definition. A countable subgroup G C C5 is self-similar if the restriction of the isomor- 
phism 35 Sym{X; to G induces an embedding G Sym(X;G); in other words, 
if all entries of the matrices M^, g E G belong to G. Note that, rigorously speaking, 
self- similarity is a property of the embedding G G (5 rather than of the group G only. The 
embedding G ^ Sym(X; G) need not be surjective (see the example below). 
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Example. The adding machine (isomorphic to the group Z) is generated by the transfor- 
mation a : z z + 1 on the ring Z2 of 2-adic integers eo + £^1 " 2 + ■ ■ ■ + ■ 2" + . . . , 
where the digits Si take values or 1. Depending on the values of initial digits, it acts in 
the following way: 

ei £2 £3 • • • ^ 1 ei £2 £^3 • • • , 

1 £2 ^3 • • • ^ 1 £2 £^3 • • • , 
1 1 £3 • • • ^ 1 £3 • • • , 



We can think of sequences [sq^Ei, . . .) as of boundary points of the binary rooted tree T = 
T{X) of the alphabet X = {0, 1}. The transformation a extends to an automorphism of T, 
and, as one can easily see from its symbolic description above, the associated generalized 
permutation matrix is 

0, 



Thus, the infinite cyclic group (a) = Z generated by the transformation a is self-similar 
(as a subgroup of the full group of automorphisms (S). 

Note that the automorphism a is completely determined by the matrix M". Indeed, 

the augmentation n,ap appi.ed to p.odnces the n.ua. pe...ntat,on n,atn. (J J 

which describes the permutation by which a acts on the first level of the tree T. Further, 
by substituting M° for a and the identity matrix for 1 in M" one obtains the matrix 



/o 
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o\ 
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[a 











augmentation of which produces the order 4 permutation matrix 



/o 
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o\ 











1 





1 
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describing the action of a on the second level of T. By recursively repeating this procedure 
one obtains the action of a on all levels of T, i.e., an automorphism of the whole tree T. 

The example above suggests the following way of defining self-similar groups by their 
matrix presentations. Fix a finite (or countable, for infinitely generated groups) set K 
(the future set of generators of a self-similar group), and assign to each k & K a. general- 
ized permutation matrix whose non-zero entries are words in the alphabet consisting 
of letters from K and their inverses (the entry associated with the empty word is 1). By 
replacing the non-zero entries of matrices M'^ with corresponding products of the asso- 
ciated matrices and their inverses we obtain generalized permutation matrices of order 
|Xp, etc. The usual permutation matrices obtained from them by the augmentation map 
determine then the action of elements from K on all levels of the tree T, i.e., the cor- 
responding automorphisms of T. See |BG00j or |Nek05] for more on recursions of this 
kind. 

A particular case of this construction arises in the situation when all non-zero entries of 
matrices M'^ are elements of the set K. In this case the assignment k, i— > M'^ amounts to 
a map (k, x) (A, y) of the product K x X to itself, i.e., to an automaton. Here y = cr{x) 
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for the permutation a = a{M^) determined by the matrix M^, and A is the matrix entry 
M!^y. The self-similar group obtained in this way is called an automaton group^ 

l.D. The Basilica group. The Basilica group B is determined by the matrix presenta- 
tion 

a 

The aforementioned recursion for this group looks in the following way: 









(0 a\ 








\l 



and the associated automaton is 
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a 0\ 




1 
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n n n 

U U U 


/ 





1 






y 




/o 


b 0\ 







1 




1 









1 0/ 


'(a,l) 


h- 


- 1) 


(a, 2) 


t- 


- (e,2) 


(M) 


h- 


- (a, 2) 


(6,2) 


h- 


- (e,l) 


(e,l) 


t- 


- (e,l) 


.(e,2) 


h- 


- (e,2). 



Here {1,2} = X is the alphabet of the binary tree T, 



group identity determined by the substitution e 



and K 
0' 



{a,b,e}, where e is the 



GZ02al fsee below for its al- 



The group B was first studied by Grigorchuk and Zuk 
gebraic propereties). The name Basilica comes from the fact that it also appears as the 
iterated monodromy group of the rational map z i— > — 1 on the Riemann sphere C 

This latter notion was introduced by Nekrashevych who created a very fruitful link 
between the theory of self-similar groups and rational dynamics. Namely, given a rational 
map (/? : C ^ C of degree d, a generic point z & C has precisely d preimages, each of which 
also has d preimages, etc. Thus, attached to a generic point 2 G C is the rooted homo- 
geneous tree of its preimages. One can move the preimage tree along any continuous 
curve consisting of generic points. However, if z follows a non-contractible loop, it may 
happen that, although the preimage tree returns to its original position, it undergoes 
a certain non-trivial monodromy transformation. Thus, there is a homomorphism of the 
fundamental group of the connected component of z in the set of generic points to the 
group of automorphisms of the preimage tree T^. The resulting subgroup of Aut(T2) is 
called the iterated monodromy group of the map (p, see jNek05] for more details. 



Now, in rational dynamics the map z 



— 1 is called the Basilica map |Bie90j . 



because its Julia set (a subset of the Riemann sphere which, in a sense, consists of the 
limit points of this map) looks similar to Basilica di San Marco in Venice (together with 



Usually, when talking about automata groups, one tacitly assumes that the corresponding automaton 
is finite, i.e., the generating set K is finite. 
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its reflection in the water), see Figure HI This Juha set also arises as the limit set of the 
Basihca group B. 




Figure 4. 



Interesting algebraic and analytic properties of the Basilica group obtained in |GZ02a 



GZ02b , IBG02j made especially relevant the question about its amenability formulated in 



GZ02aj . also see |BGN03] . 



I.E. Bounded automatic automorphisms and Mother groups. The Basilica group 
B actually belongs to a certain natural subclass of the class of self-similar groups which 
we shall now describe. 

As we have seen in Section [l.Bl given an automorphism g ^ & = Aut(T), any sym- 
bol X E X determines an associated automorphism G S5. In the same way such an 
automorphism G S5 (the state of g at the point w) can be defined for an arbitrary 
word w (z T = X*, by restricting the automorphism g to the subtree T^, with the sub- 
sequent identification of both and its image giT^) = Tg(^u}) with T. Equivalently, g^j 
can be obtained by recursively applying the presentation g i— > M^, namely, g^ is the non- 
zero entry of the row R^, of the \X\^^^ x matrix obtained by recursive expansion 
g^ M3 ^ .... 

If the set of states of g 

S{g) = {g^:weT}c<S 

is finite, then the automorphism g is called automatic. The set of all automatic automor- 
phisms of the tree T forms a countable subgroup 21 = oi <3 = C5(X). 

An automorphism g is called bounded if the sets {w G X" : 7^ e} have uniformly 
bounded cardinalities over all n. The set of all bounded automorphisms also forms a 
subgroup 03 = <B(X) of = We denote by 0521 = Q32l(X) = <B(X) n 2t(X) the 

group of all bounded automatic automorphisms of the homogeneous rooted tree T. 

It is easy to see that the generators a, b of the Basilica group B described in Section [l.DI 
are both automatic and bounded in the above sense, so that B C ^21 (here and on several 
occasions below we omit the alphabet X from our notation). More generally, any group 
generated by a (finite) automaton (see Section II. CI) is a subgroup of 21. An automaton 
is called bounded if its group is contained in 03 (therefore, in !B2t). The class of groups 
generated by bounded automata was defined by Sidki in [SidOOj . Obviously, all these groups 
are subgroups of *B2l. Most of the well-studied groups of finite automata belong to this 
class (see |BKN08j for examples). 

Groups generated by bounded automata also appear naturally in connection with fractal 
geometry. It was proved in |BN03j that every such group is contracting, and a contracting 
group is generated by bounded automata if and only if the boundary of its tile is finite. 
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This technical condition imphes that the limit space (see |Nek05] ) of such a group belongs 
to the well studied class of nested fractals (see |Kig01| ). This is the class of fractals on 



which the Brownian motion is best understood. This shows an interesting connection 
between the most well-studied class of self-similar groups and the class of fractals with 
most well-understood analysis (see [NT08j for more details). 

It turns out that the class of groups generated by bounded automata contains a count- 
able family of groups which have certain universality properties with respect to this class. 

Let X be a finite set with a distinguished element o & X, and put X = X \ {o}. Set 
A = Sym(X) and B = Sym{X; A), and recursively embed the groups A and B into ^{X) 
by the matrix presentations 

M" = Ma) , M' = J,^) 

where (f)A{o)i'pB{V) are, respectively, the permutation and the generalized permutation 
matrices corresponding to a G A, 6 G -B. Then the Mother group OJl = OJl(X) = (A, B) is 
the subgroup of 6 = 'S(X) generated by the finite groups A and B. 

A direct verification shows that both groups A, B are contained in whence the group 
!Dt(X) is a subgroup o/*B2l(X). On the other hand, as it was proved in |BKN08j . any 
finitely generated subgroup of ^2l(X) can be embedded as a subgroup into the generalized 
permutation group Sym(X^; 9Jl(X^)) for some integer N . 

Thus, in view of the fact that amenability is preserved by elementary operations (see 
Section [2. CI below), amenability of the groups *B2l(X) for all finite sets X (therefore, 
amenability of all groups generated by bounded automata) would follow from amenability 
just of all the Mother groups 9}l(X). 

It is worth noting that the groups generated by bounded automata form a subclass 
of the class of contracting self-similar groups (see |BN03l INekOS] ). It is still an open 
question whether all contracting groups are amenable. However, Nekrashevych [NekOS] 
recently established a weaker property: contracting groups contain no free groups with 
> 2 generators. 



2. Amenability 

2. A. Prom finite to infinite: von Neumann, Day and Reiter. Finite groups can 
be characterized as those discrete groups which have a finite invariant measure. In other 
words, a discrete group G is finite if and only if the natural action of G by translations on 
the space i{G) of positive normalized elements from i^[G) has a fixed point. This trivial 
observation suggests two ways of "extending" the finiteness property to infinite groups. 
One can look either for fixed points in a bigger space, or for approximative invariance 
instead of exact one. 

The first idea was implemented by John von Neumann |vN29j . according to whose 
definition amenable groups are those which admit a translation invariant mearu. A mean 
on G is a finitely additive probability measure, in other words, an element of the space 
|-£ooj*^ ^ of positive normalized functionals on i°°{G). Usual measures on G are also means, 
but if G is infinite, then there are many more means than measures (which corresponds 
to the fact that in the infinite case is significantly "smaller" than its second dual space 

[e]** = [£^]*). 

■^Actually, the original term used by von Neumann was the German mefihare Gruppe, which means 
"measurable group" in English. It was later replaced in German with mittelbare (cf. moyennable in 
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Means being highly non-constructive objects, the other way was explored (surprisingly, 
much later than the original definition of von Neumann) by Reiter |Rei65j who introduced 
(under the name Pi) what is nowadays known as Reiter 's condition for a group G: there 
exists an approximatively invariant sequence of probability measures on G, in other words, 
there exists a sequence of probability measures A„ on G such that 

||A„ -^A„|| — ^ ^geG , 

where || ■ || denotes the total variation norm. He proved that the above condition is in 
fact equivalent to amenability as defined by von Neumann. 

Example . The sequence of Cesaro averaging measures 

Xn = -^{So + Si + --- + 6n) , 
n + 1 

on the group of integers Z is approximatively invariant (here 6n denotes the unit mass at 
the point n). Thus, Z is amenable. 

2.B. Other definitions. There is a lot of other (equivalent) definitions of amenability of 
a countable group, which illustrates importance and naturalness of this notion. We shall 
briefiy mention just some of them, referring the reader to |Gre69j . |Pie84] and |Pat88] 
for more details. Moreover, the notion of amenability has been extended to objects other 
than groups, in particular, to group actions, equivalence relations, and, more generally, 
to groupoids (e.g., see |ADR00j ). 

The main application of the notion of amenability is its characterization as a fixed point 
property. Namely, a countable group G is amenable if and only if any continuous affine 
action of G on a compact space has a fixed point. An example of such an action arises 
in the following way. Let X be a compact topological space endowed with a continuous 
action of G, and let V{X) denote the space of probability measures on X endowed with 
the weak* topology. Then V{X) has a natural affine structure, and the action of G extends 
to a continuous affine action on V{X). Therefore, any continuous action of an amenable 
group on a compact space has a finite invariant measur^im fact, this property can also 
be shown to be equivalent to amenability). In the case of the group of integers Z this 
result is known as the Krylov-Bogolyubov theorem jKB37j (which is one of the starting 
points of the modern theory of topological dynamical systems). 

Yet another characterization of amenable groups can be given in terms of their isoperi- 
metric properties. This condition is basically a specialization of Reiter's condition to 
sequences of measures of special kind (although historically it was introduced by F0lner 
|F0l55| some 10 years before Reiter). Let An be a sequence of finite subsets of G, and 
let A„ be the associated uniform probability measures on A^. Then Reiter's condition for 



French), literally meaning "averageable" . In English, however, Mahlon M. Day suggested to use (appar- 
ently, first as a pun) the word amenable, which appeared in print in this context for the first time in 
1949 |Day49| . It is curious that Day himself, when he later described the history of this term in |Day83| 
on the occasion of the nomination of his paper |Day57| as a "Citation Classic" dated its appearance to 
1955: In 1929, von Neumann studied a new class of groups, those with invariant means on the hounded 
functions. My thesis (1939) studied semigroups with invariant means; thereafter, I worked in the field 
alternately with the geometry of Banach spaces. I finished a large geometrical project in 1955 and turned 
back to invariant means; in order to talk to my students I invented the term 'amenable (pronounced as 
amean'able) semigroups'. 

^Thc first proof of this fact by Bogolyubov |Bog39| published in 1939 (immediately after [KB37| ) in a 
rather obscure journal in Ukrainian remained almost unknown, see |Ano941 [CSGdlH99] . 
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the sequence A„ is equivalent to the following condition on the sets A^. 

— -— ^ ygeG, 

where A denotes the symmetric difference of two sets, and |v4| is the cardinality of a finite 
set A. A sequence of sets An satisfying the above condition is called a F0lner sequence, 
and the condition itself is called F0lner's condition. Obviously, F0lner's condition implies 
Reiter's condition; however, the usual "slicing" isoperimetric techniques also allow one to 
prove the converse, so that F0lner's condition is equivalent to amenability. 

For finitely generated groups F0lner's condition takes especially simple form. Indeed, 
in this case it is enough to verify it for the elements g from a finite generating set K G G 
only. Let us assume that K is symmetric, and denote by F = T{G,K) the (left) Cayley 
graph of the group G determined by K (i.e., the vertex set is G, and the edges are of the 
form {g, kg) with g & G and k G K^. For a set A C G denote by OA = Ok A C A its 
boundary in the Cayley graph, i.e., the set of all points from A which have a neighbor 
from the complement of A. Then a sequence of sets An C G is F0lner if and only if 




Existence of a sequence of sets as above is an isoperimetric characterization of amenability. 

Example. For the group Z with the standard generating set {±1} the boundary of the 
segment An = {0, 1, 2, . . . , n} consists of two points {0,n}, whereas \An\ — > oo, so that 
{An} is a F0lner sequence. 

2.C. Elementary groups. The class of amenable groups is closed with respect to the 
"elementary" operations of taking subgroups, quotients, extensions and inductive limits. 
Finite and abelian groups are amenable (cf. Example^. The minimal class of groups 
containing finite and abelian groups and closed with respect to the above elementary 
operations is called elementary amenable (EA) |Day57| . 

In the above paper Day asked the question whether every amenable group is elementary 
amenable. The first example of an amenable but not elementary amenable group is the 
group of intermediate growth (see below) found by Grigorchuk [GriSOl Rjri85j . Later, a 
finitely presented amenable extension of the Grigorchuk group was constructed in |Gri98j . 

However, there is yet another way to obtain "obviously amenable" groups. It is related 
with the notion of growth. Let G be a finitely generated group with a symmetric gener- 
ating set K. Denote by Bn the n-ball of the Cayley graph metric on G centered at the 
group identity, or, in other words, the set of all elements of G which can be presented as 
products of not more than n generators from K. The sequence \Bn\ is submultiplicative, 
so that there exists a limit limlog |-B„|/n. The group G is said to have exponential or 
subexponential growth depending on whether this limit is positive or zero (this property 
does not depend on the choice of a generating set K). 

The class of groups of subexponential growth contains all the groups of polynomial 
growth (the ones for which \Bn\ is bounded from above by a polynomial function; by a 
theorem of Gromov |Gro81] these are precisely finite extensions of nilpotent groups), but 
there are also examples of groups of intermediate growth, i.e., the ones whose growth is 
subexponential without being polynomial. First examples of this kind were constructed 
by Grigorchuk jGriSS] . and these groups can often be realized as self-similar groups (see 

^Elsewhere in this paper wc shall always deal with the right Cayley graphs. However, in order to keep 
the notations consistent, here it is more convenient to consider the left Cayley graphs. 
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|BGN03j ). For instance, the most famous of the Grigorchuk groups has 4 generators 
acting on the rooted binary tree with the matrix presentation 

fO 1\ ^ fa 0\ fa 0\ J A 0\ 

^^^1 oj ' ^""^0 c) ' "^^0 d) ' ^^^0 b) ■ 

If the growth of G is subexponential, then the sequence Bn necessarily contains a F0lner 
subsequence, so that the groups of subexponential growth are amenable. Therefore, one 
can change the definition of elementary amenable groups by extending the set of "build- 
ing blocks": the minimal class of groups containing finite, abelian and subexponential 
groups and closed with respect to the elementary operations is called subexponentially 
elementary amenable (SEA). Thus, a natural goal is to find amenable groups which are 
not subexponentially elementary (see jGri98l [CSGdlH99] ). 

It is the Basilica group B which provided the first example of this kind. It was shown 
in |GZ02aj that it does not belong to the class SEA, whereas it was proved in |BV05j that 
the Basilica group is amenable. We shall now explain the role of random walks in the 
proof of amenability of B. 

Remark. It is worth mentioning at this point that non-amenable groups do exist, and there 
is actually quite a lot of them (for instance, numerous matrix groups: non-elementary 
Fuchsian and Kleinian groups, lattices in semi-simple Lie groups, etc.). The first example 
is of course the free group Td with d > 2 generators (since any discrete group can be 
obtained from free groups by the elementary operations above, amenability of free groups 
would have implied amenability of all groups). Non- amenability of !Fd may be explained 
in many different ways by using various definitions. For instance, it is not hard to build 
a paradoxical decomposition of JF^ (see |CSGdlH99] and the references therein), which 
prevents it from having an invariant mean. Another way consists in noticing that there 
are continuous actions of J^a on compact sets admitting no finite invariant measures. Let, 
for instance d = 2. Then an action of JF2 on a compact K is determined by specifying two 
homeomorphisms of K corresponding to the generators of JF2. If K is the circle, then the 
only measure preserved by any irrational rotation is the Lebesgue measure. Take such 
a rotation for the first homeomorphism from the definition of the action. Then, if we 
choose the second homeomorphism in such a way that it does not preserve the Lebesgue 
measure, then these two homeomorphisms (therefore, the associated action of JF2) do not 
have any invariant measure, so that JF2 is not amenable. 



3. Random walks 

3. A. Convolution powers. The main idea behind the use of random walks for estab- 
lishing amenability at first glance looks counterproductive. Let us replace arbitrary ap- 
proximatively invariant sequences of probability measures A.„ on G from Reiter's condition 
with sequences of very special form, namely, with sequences of convolution powers /i*" of 
a single probability measure fi on G. In the same way as with F0lner's characterization 
of amenability (see Section 12. Bl) , it turns out that this restricted class of approximatively 



invariant sequences is still sufficient in order to characterize amenability. More precisely, 
a group G is amenable if and only if there exists a probability measure fi on G such that 
the sequence of its convolution powers /i*" is approximatively invariant. In one direction 
(the one we need for proving amenability) this is just a particular case of Reiter's con- 
dition, whereas in the opposite direction it was conjectured by Furstenberg [Fur 73] and 
later independently proved by Kaimanovich-Vershik |VK79l IKV83] and by Rosenblatt 
|R,os81j . 
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Now, working with sequences of convolution powers instead of arbitrary sequences 
of probability measures on G is actually easier, because of their description as one- 
dimensional distributions of random walks on G. Moreover, one can use a very powerful 
quantitative criterion of whether a given sequence of convolution powers is approxima- 
tively invariant. It is provided by the entropy theory of random walks which we shall 
briefly describe below. 

Random walks are in a sense the most homogeneous Markov chains: they are homoge- 
neous both in space and in time. The latter property means that the assignment x ^ ir^ 
of transition probabilities to the points from the state space is equivariant with respect 
to a group action, and the simplest instance of such an action is, of course, the action of 
a group on itself. 

More formally, the (right) random walk on a countable group G determined by a proba- 
bility measure /i is the Markov chain with the state space G and the transition probabilities 

equivariant with respect to the left action of the group on itself. In other words, from 
a point g the random walk moves at the next moment of time to the point gh, where 
the random increment h is chosen according to the distribution /i. We shall use for this 
description of transition probabilities of the random walk (G, /x) the notation 

g ^7 — ' gh . 

Thus, if the random walk starts at moment from a point g^, then its position at time n 
is 

gn = gohlh2 ... /in , 

where hi is a Bernoulli sequence of independent //-distributed increments. Therefore, the 
distribution of the position gn of this random walk at time n is the translate gofi*^ of 
the n-fold convolution power fi*"' of the measure /i (the convolution of two probability 
measures /ii,/i2 on G is defined as the image of the product measure /xi ® /i2 on G x G 
under the map (5^1,5^2) ^ gm)- 

3.B. Trivial future. Reiter's condition 

||^^-_^-|| _ WgeG 

for the sequence of convolution powers /i*" means that the time n one-dimensional dis- 
tributions of the random walk issued from the group identity e and from an arbitrary 
point g & G asymptotically coincide. Probabilistically, the fact that the one-dimensional 
distributions of a Markov chain asymptotically do not depend on their starting points 
means that the "remote future" (behaviour at infinity) of the chain does not depend on 
its present, which, in view of the classical Kolmogorov argument means that the "remote 
future" must be trivial. 

In order to explain the latter notion in a more rigorous way, let us look at two examples: 
the simple random walks on the 2-dimensional integer lattice (= the Cayley graph of the 
group Z^) and on the homogeneous tree of degree 4 (= the Cayley graph of the free group 
with 2 generators). The simple random walk on a graph is the one whose transition 
probabilities are equidistributed among neighbours; the simple random walk on the Cayley 
graph of a group is precisely the random walk on this group determined by the probability 
measure /i = fix equidistributed on the generating set K. Locally, each of these graphs 
is regular of degree 4 (each point has 4 neighbours), but their global geometry is very 
different, see Figure [5l In particular, the Cayley graph of JF2 is endowed with a natural 
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boundary dJ^2 which can, for instance, be identified with the space of all paths without 
backtracking issued from the identity of the group (or any other reference point). 



Figure 5. 



The global behaviour of sample paths of simple random walks on these graphs is also 
very different. Sample paths of the simple random walk on JF2 converge a.s. to the 
boundary dJ^2 (of course, different sample paths may converges to different limits). Thus, 
these limit points can be used to distinguish sample paths by their behaviour at infinity. 
On the other hand, although sample paths of the random walk on 1? are quite complicated 
(their scaling limit is the Brownian motion on the plane), they all look "the same", see 
Figure El 




Figure 6. 



Formally speaking, the "remote future" of a Markov chain is described by its tail a- 
algebra 

n 

which is the limit of the decreasing sequence of a-algebras Sljf generated by the positions 
of the chain at times > n. Thus, the boundary convergence of sample paths of the simple 
random walk on ^-"2 at once implies non-triviality of its tail a-algebra. On the other hand, 
in spite of absence of any visible behavior at infinity, proving triviality of the tail a-algebra 
for the simple random walk on 7? requires additional work. 

On a formal level a criterion of the triviality of the tail a-algebra of an arbitrary Markov 
chain is provided by the corresponding 0-2 law jDer76l IKai92] . Its "zero part" for the 
random walk on a group G determined by a probability measure fi takes the following 
form (see |KV83j ): the tail u-algebra of the random walk is trivial if and only if 

y^*n _ ^*(n+l) || ^ q G SUpp . 

n— +00 



14 



VADIM A. KAIMANOVICH 



Thus, if the support of the measure jj, generates G as a group, and the tail a-algebra of 
the random walk (G, fi) is trivial, then the sequence of Cesaro averages of the convolution 
powers satisfies Reiter's condition, and therefore the group G is amenable (actually, if yU. 
is aperiodic, then the sequence of convolution powers /i*" also satisfies Reiter's condition). 

3.C. Asymptotic entropy. Let p = (pi) be a discrete probability distribution. By 
Shannon the amount of information contained in any outcome i is — logpj. The average 
amount of information per outcome 

Hip) = -^Pilogpi 

is called the entropy of the distribution p. The entropy has the following two properties 
(by which it is essentially characterized, see [YY83] ): 

• monotonocity: Hp' is a quotient of the distribution p (i.e., p' is obtained by gluing 
together some of the states of p), then 

H{p') < H{p) , 

• additivity: for any two distributions pi,P2 the entropy of their product is 

H{pi^P2) = H{pi)+H{p2) . 

Let fihe a probability measure on a countable group G with finite entropy H{fi). The 
(asymptotic) entropy of the random walk (G, /i) is defined as 

h{G,fi) = lim 



n 

in other words, h(G, n) is the asymptotic mean specific amount of information about 
a single increment hi in the product Qn = /ii/i2 • • • ^n- Existence of the above limit 
follows from the fact that the sequence H{fi*'"') is subadditive. Indeed, by the definition of 
convolution, the measure ^*("+"^) is a quotient of the product measure fi*"' (8> fi*"^, whence 

by the above monotonicity and additivity properties. 

The asymptotic entropy was first introduced by Avez [Ave72j . As it turned out, it plays 
a crucial role in understanding the asymptotic properties of the random walk {G,fi). 
Namely, as it was independently proved by Kaimanovich-Vershik |VK79t IKV83j and 
Derriennic |Der80j . the asymptotic entropy h{G,fi) vanishes if and only if the tail a- 
algebra of the random walk {G,fi) is trivial. 

This result means that the "remote future" of the random walk (G, /i) is trivial if and 
only if the amount of information about the first increment hi in the random product 
Qn = hih2 . . .hn asymptotically vanishes. Indeed, on a more rigorous level, this amount 
of information is the mean conditional entropy (see |Roh67j ) H {hi\gn) of hi with respect 
to gn. It can be easily seen to coincide with the difference H (//*") — H (/i**^""^)) between 
the entropies of two consecutive convolution powers, whence the claim. 

3.D. Self-similarity and RWIDF. A random walk with internal degrees of freedom 
(RWIDF) on a group G is a Markov chain whose state space is the product of G by 
a certain space X (the space of degrees of freedom) , and its transition probabilities are 
equivariant with respect to the action of G on itself. Thus, the transition probabilities 
are 

(fif,x) I > [gh,y) , 



SELF-SIMILARITY AND RANDOM WALKS 



15 



where M = {nxy '■ x,y & X} is a |X| x \X\ matrix of subprobability measures on G such 
that 

J2\\l^xy\\ = l VxeX. 

y 

The image of the RWIDF [G, M) under the map {g, x) h-> x is the quotient Markov 
chain on X with the transition matrix 

P = (Pxy) , Pxy = Wf^xyW , 

which is the image of the matrix M under the augmentation map. 

There is a natural hnk between random walks on self-similar groups and random walks 
with internal degrees of freedom. Let // be a probability measure on a self-similar group 
G. Then the associated random walk is 

g I > gh . 

By using the self-similar embedding G Sym{X; G) it gives rise to the random walk on 
the generalized permutation group Sym(X; G) with the transition probabilities 

Since multiplication of the matrix G Sym(X; G) by the increment is done row by 
row, we obtain the following Markov chain on the space of these rows: 

R RM^ . 

Due to the definition of the group Sym(X; G) the rows of the corresponding matrices can 
be identified with points of the product space G x X (each row has precisely one non-zero 
entry, so that it is completely described by the value of this entry and by its position). 
Therefore, the latter Markov chain can be interpreted as a Markov chain on G x X whose 
transition probabilities are easily seen to be invariant with respect to the left action of G 
on G X X, clS 8b random walk on G with internal degrees of freedom parameterized 
by the alphabet X. This RWIDF is described by the transition probabilities matrix 

a 

Recall that stopping a Markov chain at the times when it visits a certain recurrent 
subset of the state space produces a new Markov chain on this recurrent subset (the trace 
of the original Markov chain). For a random walk with internal degrees of freedom on 
G X X determined by a matrix M take for such a subset the copy G x {x} of the group 
G obtained by fixing a value x & X. It is recurrent provided the quotient chain on X is 
irreducible. The transition probabilities of the induced chain on G x {x} are obviously 
equivariant with respect to the left action of the group G on itself (because the original 
RWIDF also has this property). Therefore, the induced chain on G x {x} is actually the 
usual random walk determined by a certain probability measure fi^ on G. 

The measures n^, x & X can be expressed in terms of the matrix M as 

= fi^x + M^^ (/ - M^)"^ , 

where M^x (resp., M^^) denotes the row {^ixy)y^x (resp., the column {tiyx)y^x) of the 
matrix M with the removed element ^xx-, and M^x is the (c? — 1) x (rf — 1) matrix (where 
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d = \X\) obtained from M by removing its x-th row and column. The multiplication 
above is understood in the usual matrix sensed 

This is elementary probability. We look at the quotient chain on X and replace its 
transition probabilities p^y with the transition measures fixy in the identity 

oo 

1 — Pxx + ^ ^ ^ ^ PxyoPyoyi ' ' ' Pyn-iynPynX 
n=0 yo,...,y„^x 

Pxx ~l~ Pxx (-^ ~l~ Ptx ~I~ Pxx "!"•••) Pxx 

Pxx ~l~ Pxx (-^ Pxx) Px 



XX 1 



which yields 

oo 

fJ' = fJ'xx + ^ ^ ^ ] f^xyof^yoyi ' ' ' fJ'yn-iynf^ynX 
n=0 yo,...,y„j^x 

= /i,, + (/ + + M4 + . . . ) 

= fixx + Mxx {I - Mxx)~^ Mxx ■ 

The first term in this formula corresponds to staying at the point x (and performing on 
G the jump determined by the measure ^xx)i whereas in the second term the first factor 
corresponds to moving from x to X \ {x}, the second one to staying in X \ {x} (each 
matrix power corresponding to staying in X \ {x} for precisely n steps), and the 
third one to moving back from X \ {x} to the point x. The matrix notation automatically 
takes care of what is going on with the G-component of the RWIDF. 

The measures /i^ also admit the following interpretation in terms of the original random 
walk (G, /i) with the sample paths {gn)'- we look at it only at the moments n when gn{Tx) = 
Tx, and /i^ is then the law of the induced random walk on the group Aut(Tj.) = Aut(T). 

3.E. The Miinchhausen trick. Let us now look at what happens with the asymptotic 
entropy in the course of the transformations 

Since the information contained in a matrix does not exceed the sum of informations 
about each row, passing to asymptotic entropies we obtain the inequality 

h{G,fi) < dh{G,M) . 

Further, all points x G X are visited by RWIDF [G, M) with the same asymptotic fre- 
quency 1/d (because the uniform distribution is stationary for the quotient chain on X), 
whence 

/i(G',/i^) =dh{G,M) > h{G,fi) . 

The idea of the Miinchhausen consists in combining two observations. The 

first one is the above entropy inequality. The second observation is that if the measure 
/i^ is a non-trivial convex combination 

/i"" = (1 — a)6e + afi , < a < 1 

of the original measure fi and the (5-measure at the identity of the group (in which case 
we call the measure /i self- similar), then 

/i(G,/i^) = ah{G,fi) . 



it is pointed out in |GN07| , this formula corresponds to the classical operation of taking the Schur 
complement of a matrix. 
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The result of these two observations is the inequahty 

h{G, ji) < ah{G, /i) . 

Taken into account that < a < 1, it is only possible if h{G,fi) = 0, which proves 
amenability of the group G. 

We shall now show how the Miinchhausen trick works in two particular cases: for the 
Basilica group B and for the Mother groups 9Jt. 

3.F. The Basilica group. As we have seen in Section fl.Dl the Basilica group is defined 
by the matrix presentation 

'b 0\ ^ fO a 

1' ^^10 



B : a 



Take on i5 a symmetric probability measure fi supported by the generators a, b and their 
inverse^: 

H = a{a + a^^) + P (b + b~^) , 
for a,f3 > such that 2(a + (3) = 1. The matrix associated with the measure /i is 

a{b + b~^) (3{l + a 

Therefore, the trace of the RWIDF [B, M^) on the copy of B corresponding to the first 
letter 1 of the 2-letter alphabet {1,2} is the random walk governed by the measure 

/i = yU^ = /ill + Ail2 (1 - /^22)~^ /i21 

= a{b + b-') + ^(1 + a)(l + a^i) = (3+^{a + a^') + a{b + b^') . 

If 

a P 
then 

/I = /?+(l-/3)/i, 

so that the measure /i is self-similar, and Miinchhausen 's trick is applicable. 

3.G. The Mother group. For proving amenability of the Mother group DJl (here and 
below we omit the alpabet X) one can apply an approach somewhat different from the 
one which was used above for the Basilica group. It is based on the fact that the Mother 
group 071 is generated by two finite subgroups A and B. We take for the measure /i the 
convolution product of the uniform measures and on these subgroups: 

/i = rriA * rriB ■ 



2a^ = 0' 



"^It is named after venerable Baron Miinchhausen who . . once rode on a cannon-hall and next told 
about it. Another time he reported he had to get himself and the good horse he sat on, out of a quagmire 
by pulling his own hair till he saved himself and his horse ..." 

®For simphcity we pass from now on to the group algebra notations putting g for a (5- measure 5g, g ^ e 
and just 1 for the (5-mcasure 5e concentrated at the identity of the group. 
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Then the matrix M'^ has a very special form 



/is 
fiAEd~i 



where d = \X\, and Ed denotes the order d matrix with entries 1/d, so that has 
identical rows with entries 



xy 



fis/d ify = o, 
fiA/d otherwise . 



It means that transition probabilities of the associated RWIDF {DJl x X, M^) do not 
depend on x, so that its projection to 971 is just the random walk (OJl, /I) determined by 
the measure 

~ ^ ..u d-1 1 

y 

whereas the projection of RWIDF (071 x X, M^) to X is the sequence of independent 
X- valued random variables with uniform distribution on X (because all entries M^f,, have 
mass 1/d). Thus, 

/i(nJt,/i) < dhim,]!) . 

The measure /I is a convex combination of the idempotent measures fiA and /i^, so that 
its convolution powers are essentially convex combinations of the convolution powers of 
/i. The total number of m^'s in the n-fold convolution of /I is ~ n/d, but some of them 
disappear because niBmB = mB, so that in fact /I*" is the convolution of about 
copies of /i = rriA * mB- Thus, 



n 



whence h{n, fi) = 0. 



h{m,fi) = ^h{m,fi) 
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